Study of Overlapped Speech Detection for NIST SRE Summed Channel Speaker Recognition
نویسندگان
چکیده
This paper studies the overlapped speech detection for improving the performance of the summed channel speaker recognition system in NIST Speaker Recognition Evaluation (SRE). The speaker recognition system includes four main modules: voice activity detection, speaker diarization, overlapped speaker detection and speaker recognition. We adopt a GMM based overlapped speaker detection system, by using entropy, MFCC and LPC features, to remove the overlapped segments in summed channel test condition. With the overlapped speech detection, the speaker diarization achieves a relative 18% diarization error rate reduction for the 2008 NIST SRE summed channel test set, and we obtain relative equal error rate reductions of 13.3% and 9.4% in speaker recognition on the 1conv-summed task and 8convsummed task, respectively.
منابع مشابه
The IIR NIST SRE 2008 and 2010 summed channel speaker recognition systems
This paper reports the IIR speaker recognition system for the summed channel evaluation tasks in the NIST SRE 2008 and 2010. The system includes three main modules: voice activity detection, speaker diarization and speaker recognition. The front-end process employs a voice activity detection algorithm for effective speech frame selection. The speaker diarization system that was developed for 20...
متن کاملThe NIST SRE summed channel speaker recognition system
This paper presents an improved speaker recognition system for the summed channel evaluation tasks in the 2008 NIST SRE (SRE08) with multiple summed-channel excerpts for speaker training and one summed-channel excerpt for testing. The system includes three main modules in which a hybrid speaker purification and clustering algorithm is adopted to segregate the summed-channel speech, a common spe...
متن کاملSpeaker Verification On Summed-Channel Conditions With Confidence Measures
This paper addresses the problem of speaker verification in two speaker conversations, proposing a set of confidence measures to assess the quality of a given speaker segmentation. We study how these measures can be used to estimate the performance of a state-of-the-art speaker verification system, the I3A submission for the core-summed condition in the NIST SRE 2010. We present a Factor Analys...
متن کاملSpeaker Verification on Summed-Channel Conditions with Confidence Measures Verificación de locutor en condiciones de canal sumado con medidas de confianza
This paper addresses the problem of speaker verification in two speaker conversations, proposing a set of confidence measures to assess the quality of a given speaker segmentation. We study how these measures can be used to estimate the performance of a state-of-the-art speaker verification system, the I3A submission for the core-summed condition in the NIST SRE 2010. We present a Factor Analys...
متن کاملThe 1999 NIST speaker recognition evaluation, using summed two-channel telephone data for speaker detection and speaker tracking
The 1999 NIST Speaker Recognition Evaluation encompassed three tasks: one-speaker detection, two-speaker detection, and speaker tracking. All tasks were performed in the context of conversational telephone speech. The one-speaker task used single channel mu-law data; the other tasks used summed twochannel data. Twelve sites from the United States, Europe, and India participated in the evaluatio...
متن کامل